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1. INTRODUCTION 


A software interface specification, for the Highlander primary core 2D routines is proposed. 
The interface specifies the primary core primitives and associated host-memory parameter 
blocks used by the Windows 95 graphics device driver to accelerate a range of 2D operations. 


Ze PRIMARY CORE 2D FUNCTIONALITY - AN OVERVIEW 


Primary core routines are provided that perform scaled and unscaled rectangular image 
transfers and multiple image transfers (text), arbitrary angle line and polyline drawing, ellipse 
(circle) drawing and composite drawing and filling operations like polygons, pies and chords. 


Previously loaded by the Kernel Manager, the code fragments comprising the Primary Core 
2D functionality run on a background thread which blocks on an internal 8 bit register which 
is visible in the PCI configuration space. This register is accessible from Host space and its 
address is returned by the Kernel Manager in a structure which also contains the execution 
address of the primary core code fragment to run when, after a host write to the ‘kicker’ 
location the thread deblocks. For 2D processes this execution address is that of a primary 
core code fragment that scans a host circular buffer which contains the the 2D operations and 
their associated parameters. Two host and primary code accessible dwords contain the Input 
and output pointers for the circular command and parameter buffer. The pointers are used by 
the primary core for 2D command extraction and by the host for command insertion. Whilst 
there are commands in the buffer the primary core buffer processor extracts the 2D 
commands and dispatches them to their primary core code handlers. When the buffer is 
empty the thread returns to blocking on the kicker semaphore. The asynchronicity between 
host and primary core disallows the use of status information to inform the host whether or 
not the thread is blocked or running. As such, each command insertion must be accompanied 
by a write of the execution address and a write to the kicker location. The width of the 
primary core counter linked to the kicker location places a limit on the number of sequential 
‘re-kicks’ that can be performed and given competion for this counter from direct draw, 
direct 3D and video capture drivers it is suggested that a maximum of 32 Host commands be 
inserted into the buffer at any one time.Status information is maintained by the primary core 
such that the host may know when the 2D pixel pipeline is empty and drawing has ceased so 
that it may read the frame buffer after completion of all outstanding 2D commands. 


Described in this document are the parameter sets comprising a 2D operation that would be 
inserted into the command buffer by the Host driver. 


The interface provided by the primary core is closely coupled to the functionality provided by 
the secondary (2D) core. As such, the associated parameter set is predicated upon the need 
for specific hardware optimisations for the wide variety of possible 2D operations, even 
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within a single class of operation like BitBlt, along with a consideration of the most 
advantageous host-driver/primary core functional split. These considerations yield a 
parameter set that consists of selected portions of the host drivers GDI-supplied parameter set 
with additional information which may specify primary core routines specific to the 2D op 
being handled and may include complex addressing parameters, the calculation for which 
may be too slow or impossible on the primary core. 


As a part of the software interface and placed within the command buffer by the host driver 
are the primary core executable addresses for the 2D operation handlers, for example Blt and 
Text. In addition to these executable addresses there may be others, subfunctions of the main 
2D op handler that may be required. These addresses are extracted by the host driver at init 
time via the Kernel Manager API. Pointer references to data structures inserted in the 
command buffer must refer to locked, contiguous host or framebuffer memory. The Kernel 
Manager can allocate and lock memory dynamically for this purpose but it is anticipated that 
both the command buffer memory and any other locked host areas (for colour patterns or text 
glyphs, for example) that are accessed by Highlander will be statically allocated by the VDD 
at Windows boot time. Note GlobalAlloc cannot be used because Highlander must be 
guaranteed contiguous physical memory. The Kernel Manager dynamic memory allocation 
finds use in the offscreen cacheing of bitmaps at RealizeObject time. 


2.1. 2D Circular Command Buffer Management 


All blts are communicated to the primary core via. a large (64K) area of memory known as 
the circular command buffer. This behaves as a conventional cyclic queue, with a read and a 
write index into the buffer. These indices are dwords pointed to by pdwRoff and pdwWoff. 
They live in the cache coherent region of host memory (though the primary core keeps a local 
copy of the read offset, and updates the host version every now and again). When you write 
stuff into the buffer you modify the write index accordingly. Note that both of these are 
offsets in bytes, and there are macros for updating the pointers and checking for free space (in 
kmapi.h). There is a 4K overflow buffer on the end of the normal 64K, so you don’t have to 
split (small) commands up when you reach the end of the buffer. 


Commands contain a primary core exe address field and a command size field (measured in 
bytes), at the head of their data, followed by the particular command. 


The sequence for performing a blt is: 


1) Exchange | with the dwAccess field of the circular command buffer 

2) repeat until the result of the exchange is 0 

3) Call GET_PCCCB_SPACE(*pdwRoff, *pdwWofff) 

4) If there is insufficient space then wait or split the command 

5) If your command may be > 4K then you must test to make sure that Woff + 
CommandSize < 68K. If its not then you must split the command into 4K chunks. (or you 
could write a dummy command to force wrap around etc.) 
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6) Write the command into the buffer at the Write offset. 

7) Call the UPDATE_PCCB_ WOFF(CommandSize, *pdwWoff) macro (which does Woff = 
(Woff + CommandSize) & 65535) 

8) Hit the kicker register for the BGO thread. (write a 1 into it) 

9) Write a zero into the dwAccess field of the circular command buffer 


Commands may be agglomerated together by writing them all at once, but you must 
remember to make sure that they meet the 4K condition 5. And write the number of 
agglomerated commands into the kicker register, rather than just a 1. 


Note that all commands must be a multiple of 64bytes in size (though if its smaller than this 
you don’t actually have to write out the unused data) because of various cacheing issues on 
the primary core. 


There are two ‘special’ commands: 
NoCmd: 


NO-OP 2D command handler. The size field is used to tell the primary core how much to 
update its read pointer by. 


Status: 


This allows us to determine the completion status of the preceding commands. It takes three 
parameters: the address to write to and the value to write (ignored on 1B) when the 
preceeding operations complete. It also takes a flag indicating whether or not we should flush 
any pipelines after the blt. [You'd have to set this if you are using the status command as a 
synchronization point rather than indicating the ability to reuse a cache entry]. There is also a 
facility to halt the primary core until the flush has actually completed. 


If you’re initialising your driver for the first time, you can set the position of the read and 
write offset pointers using KMSetCmndBufferInfo. 

De PRIMARY CORE 2D SOFTWARE INTERFACE SPECIFICATION 

3.1 Buffer and Memory Management and the NOP and Status commands 


See § 2.1 for description. 
3.1.1 The NOP Command 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


Loo Size of parameter block in bytes (decimal) 


04 [NoCmd] Execution address of 1Core NOP function 


HILK028 Version Draft I 12 November 1998 


1oS) 


1deoLogic onfidentia 
; VidcoLogic VidseLape Contents) 


Highlander 2D Graphical Operations 
Primary Core Software Interface 


3.1.2 The Status Command 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


foo foes Size of parameter block in bytes 
Execution address of 1 Core Status function 


[DoneStatusPtr] Address of completion status DWORD. The primary core sets 
the [DoneStatusValue] on completion of all previous blts 
[DoneStatus Value] 


10 [Flags] 


24-bit value to write on completion 


0 = indicate completion, 1 = indicate and flush 


2 = indicate, flush and wait 


3.1.3 The Memory Move Command 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


00_|12+Blocks*12__| Size of parameter block in bytes 
(08 _| [NumBlocks} __| Number of memory blocks to move, each has 3 DWORDS _| 
[Source 
Size of transfer in bytes 


3.2 BitBlt 


Hardware limitations and optimisations yield 24 different forms of ternary ROP-governed 
rectangular image transfer. The forms are dependent on the ROP and the type of source and 
pattern data and consist of the following: 


(1) Blt without destination read (BIt) 

(2) Blt with destination read (DBIt) 

(3) Mono source blt without destination read (mSBIt) 
(4) Mono source bit with destination read (mSDBIt) 
(5) Colour source blt without destination read (cSBIt) 


(6) Colour source blt with destination read (cSDBIt) 
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(7) Mono pattern blt without destination read (mPBIt) 

(8) Mono pattern blt with destination read (mPDBIt) 

(9) Solid pattern blt without destination read (sPBIt) 

(10) Solid pattern blt with destination read (sPDBIt) 

(11) Colour pattern blt without destination read (cPBIt) 

(12) Colour pattern blt with destination read (cPDBIt) 

(13) Mono source mono pattern blt without destination read (mSmPBIt) 
(14) Mono source mono pattern bit with destination read (mSmPDBIt) 
(15) Mono source solid pattern blt without destination read (mSsPBIt) 
(16) Mono source solid pattern blt with destination read (mSsPDBIt) 
(17) Colour source mono pattern blt without destination read (cSmPBIt) 
(18) Colour source mono pattern blt with destination read (CSmPDBIt) 
(19) Colour source solid pattern blt without destination read (cSsPBIt) 
(20) Colour source solid pattern blt with destination read (cSsPDBIt) 
(21) Mono source colour pattern blt without destination read (mScPBIt) 
(22) Colour source colour pattern blt without destination read (cScPBIt) 
(23) Mono source colour pattern blt with destination read (mScPDBIt) 


(24) Colour source colour pattern blt with destination read (cScPDBIt) 


Appendix I specifies the particular 1Core execution address to setup given the GDI supplied 
ROP, source and pattern data. Note that the source/destination bitmap addresses represent the 
actual data to be transferred, i.e. clipping is performed by the host driver. 


3.2.1 Blt with & w/out destination read (DBIt & Blt) 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


64 Size of parameter block in bytes rounded up to 64 
04 [Blt] or [DBIt] Execution address of 1Core bitblt function 


los | DestPhysBase] Physical base address of frame buffer bitmap 


0C 
10 
14 


hia | tsizexy] (Size X << 16) | (Size Y) 


18 


[ 
[ 

j10_|testxy) | Dest X posn << 16) |(Pixel Yposm) 
(SieX<<19|(SieeY) 
[ 


ROP] ROP 


HILK028 Version Draft I 5 12 November 1998 


; VidcoLogic VideoLogic Confidential 


Highlander 2D Graphical Operations 
Primary Core Software Interface 


= 


3.2.2 Buffer for cPBIt & cPDBIt 


00 [61 ____| Size orparameter block in bytes rounded upto64 
[08 | [DestPhysBase] | Dest Physical Base address 
C_| Desistride 
[ROPRotate 
[ColParStat 


3.2.3 Buffer for sPBIt & sPDBIt 


loo foe Size of parameter block in bytes rounded up to 64 


04 Primary Core execute address 
Dest Physical Base address 
Dest Stride << 16 
Dest (X pixel posn << 16) | (Y pixel posn) 
[SizeX Y ] (Size X << 16) | (Size Y) [in pixels] 
ROP 


[SolidColour] Solid pattern colour 


3.2.4 Buffer for cSBIt & cSDBIt 


00 [64 __| Size of parameter block in bytes rounded upto64 
(08 | DestPhysase} | Dest Physical Base address 
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[SrcBaseAddress] Source Bitmap Base address 


The SrcByteOffset should contain the offset (in bytes) between the top left and bottom left 
corners of the sre region + 1, if a backwards blt is required. Note that this restricts us to blts 
smaller than 16Meg. If a backwards blt is not required, then this field should be set to zero. 


3.2.5 Buffer for All Other Blts 


04 [BltName] Primary Core execute address 
[ 


Size of parameter block in bytes rounded up to 64 


DestPhysBase]| Dest Physical Base address 
[SrcDestStride] (Dest Stride In Bytes << 16) | (Sre Stride In Bytes) 


[DestX Y] Dest (X posn in pixels << 16) | (Y posn in pixels) 
[SizeX Y] Size X in pixels << 16) | (Size Y in pixels) 
[RotROP] yrot << 19) | (xrot << 16) | ROP 


1C_ | [MonoSrcOffset] or | See below! 
[SrcByteOffset] 


[ColPatBase] Colour pattern physical base, or solid pattern colour 
or [SolidPatCol] 


For a mono src bit, the MonoSrcOffset field contains the offset in bits from the SrcPhysBase. 
(a number between 0 and 7) 


_ 
oo 
=> 


—_ |} |o 
RISCI1A 


— 


For a colour source bit, the SrcByteOffset field contains the byte offset of the lower left 
corner of the blt from the upper left corner + 1, if a backward blt is required. If the blt does 
not need to be done backwards, then this field must be zero. 
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3.3 Text 


The 1Core Text function is Windows 95 specific in that it assumes a byte packed format for 
the glyphs and that the font is a small font. The specificity is summed up in that the 
GlyphInfo buffer entries are composed of the 4, byte-sized, SMALLROWGLYPH structure 
elements concatenated into the DWORD field entry. It is a minor extention to the Icore code 
and to this interface to accept glyphs defined by the LARGEROWGLYPH structure and even 
the WindowsNT GLYPHBITS structure. These structures all define the x and y offsets of the 
glyph character from the glyph bounding box and the number of x and y pixels comprising 
the glyph. The difference in the structures is the size of the data type used for the purpose 
(8bit BYTEs in SMALLROWGLYPH, 16bit WORDs for LARGEROWGLYPH and 32bit 
ULONGS for GLYPHBITS). Since Highlander performs bit addressing of monochrome 
bitmaps it is possible to provide a Text function that uses bit-packed glyphs. Theoretically 
this is the way to go. 


The structure of this interface assumes some form of glyph cacheing such that the physical 
addresses of same are readily generated from the GDI requested list of character indices. 
Clearly the whole string may be dispatched to the lcore via this interface, however graphics 
driver development has shown that the text bounding box parameter sent to ExtTextOut can 
be used to effectively reduce the actual number of glyphs processed. 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


48+12N [64] Size of parameter block in bytes (N = 
NumCharacters) -- rounded to next highest multiple 
of 64 


loa sd Text sd Execution address of 1 Core bitblt function 
et [DestPhysBase] Physical base address of frame buffer bitmap 


[DestStride] : Bits 1 - 31 : DestStride in pixels 


[Transparency] Bit 0 : Transparency. 0 = opaque, 1 = transparent 


[DestXPixelAddr] Dest bitmap X pixel address 
— [DestY PixelAddr] Dest bitmap Y pixel address 


[ClipTop]:[ClipLeft] ClipRegion top pixel addr (in hi word) and left pixel 
addr 


[ClipBot]:[ClipRight] | ClipRegion bottom pixel addr (in hi word) and right 
pixel addr 

[ForegroundColour] Physical colour specified by TextColor member of 
DrawMode struct 
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[BackgroundColour] Physical colour specified by bkColor member of 
DrawMode struct 


[NumCharacters] Number of text glyphs to draw 
[PhysicalBaseGlyph1] | Physical base address of glyph1 
i 


Interglyph width 1 


2C+8N PhysicalBaseGlyphN] | Physical Base Address of glyph N 
Interglyph width N 
@ = zersoven | PhyswriteBackadde | 


3.4 Scaled BitBIt 


The 2D core provides functionality for the following subset of StretchBlt operations: 


(1) Single source (ROP = SRCCOPY) scaled bit at 1 pix/clock (cSScl) 
(2) Sre and Dst without mono Pat scaled blt at 1 pix/2 clock (cSDScl) 

(3) Sre and Dst with mono Pat scaled bit at 1 pix/2 clock (cCSmPDScl) 

(4) Src, Dst and colour Pat scaled blt at 1 pix/2 clocks (cScPDScl) 


= As with BitBlt the 1Core interface to use is determined from the ROP and pattern type (if 
present). Scaling of mono sources is not supported. Note that in cases (3) and (4) the 
SRCCOPY ROP can be used but there is no increase in speed. This is because the routines 
perform a read on the destination even if this data is not actually required by the ROP. As 
with BitBIt, the source/destination bitmap addresses represent the actual data to be 
transferred, i.e. clipping is performed by the host driver. 


3.4.1 Single source (ROP = SRCCOPY) scaled bit (cSScl) 
& Src and Dst without mono Pat scaled bit (cSDScl) 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


108 | [DestPhysBase] __| Physical base address of frame buffer bitmap | 
Dest bitmap X pixel address 
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[DestYPixelAddr] 
[XDstPixelExtent] 


[YDstPixelExtent] Y pixel extent 


[SrcPhysBase] Physical base byte address of source bitmap 


[SrcByteOffsetAddr] Src bitmap byte address offset from physical base (offset 


Dest bitmap Y pixel address 


X pixel extent 


14) (16bit value) 


[SrcBitmapByteStride] byte length between pixels on successive rows of src 
bitmap 


[XSrePixelExten] 


3C | [YSrcInc]:[YSreIncFrac] | Y pixel integer & fractional addr increments (via 
(Src YExt<<16)/DstYExt) 
40 [ROP3] Ternary ROP code = SRCCOPY, NOTSRCCOPY only 
for cSScl 


3.4.2 Sre and Dst with mono Pat scaled blt ((SmPDScl) 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


foo fis Size of parameter block in bytes rounded up to 64 
Execution address of 1 Core bitblt function 

08 | [DestPhysBase] Physical base address of frame buffer bitmap 
Dest bitmap X pixel address 

Dest bitmap Y pixel address 

[XDstPixelExtent] X pixel extent 

Y pixel extent 

Physical base byte address of source bitmap 


[SrcByteOffsetAddr] 


24 [SrcBitmapByteStride] 


Sre bitmap byte address offset from physical base (offset 
14) (16bit value) 


byte length between pixels on successive rows of src 
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ne 
[SreYPixel Adar 
[XSrePixelE tent] 
[YSrePixelExtent 


38 [XSreInc]:[XSreIncFrac] | X pixel integer & fractional addr increments (via 
(SrceXExt)<<16/DstXExt) 
3C_ | [YSreInc]:[YSreIncFrac] | Y pixel integer & fractional addr increments (via 
(Src YExt<<16)/DstYExt) 


[48 | tatiermxrotateComty | 
4c | frawemyRowteCouny | 


50 [PatForegroundColour] Physical colour specified by fgColor member of 
PBRUSH struct 
54 [PatBackgroundColour] | Physical colour specified by bkColor member of 
PBRUSH struct 
8 


[ROP3] Ternary ROP code 


3.4.3. Sre and Dst with colour Pat scaled blt (¢ScPDScl) 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


s 


08 _|[DestPhysBase]___| Physical base address offame buffer bitmap 
IC 
0 I 


2 SrcByteOffsetAddr] Src bitmap byte address offset from physical base (offset 
14) (16bit value) 


bitmap 
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Sre bitmap Y pixel address 


38 [XSrcInc]:[XSrcIncFrac] | X pixel integer & fractional addr increments (via 
(SrcXExt)<<16/DstXExt) 
| 


] 
3C [YSreInc]:[YSreIncFrac] | Y pixel integer & fractional addr increments (via 
(Src YExt<<16)/DstYExt) 


[PatPhysBase] Physical base address for colour pattern 


[PatternXRotateCount] 


[PattemYRotateCount} | 
[ROP3] Ternary ROP code 


3.5 DibBit 


We currently have routines written which perform depalettization for a four or eight bit 
sources, and which can blt a source of 16, 24, or 32bpp into a 16, 24, or 32 bit destination, 
using either SRCCOPY or NOTSRCCOPY rops. 


3.5.1 4/8 bpp Src Blt (depal) 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


(04 | fadepatbiy _| Execution address of 1Core bitblt function 
108 | tDestPhysBase] __| Physical base address of frame buffer bitmap 
10 | ateweBase) _| Physical base address ofpalete 
14 |rop) | Rop code (SRCCOPY or NOTSRCCOPY) 


04 
1 
1 
1 


[PaletteEntryBytesPerPi | Bytes per pixel for each palette entry. This must be 4, or 
xel] must match the destination surface bytes per pixel 


28 [DstX : DstY] Position (in pixel) to blt to 


HILK028 Version Draft I 


20 [SrcBitOffset] Initial offset into the source, in bits 
2 . Thi ; 
2 


0 

4 

C | [DstStride : (Dest stride in bytes << 16) | SrcStride in Bytes 
SrcBitmapByteStride] 

4 

8 
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[Xsize : Ysize] Size of blt to perform (in pixels) 


This blt allows one to bit from a4 or 8 bit source surface into the current frame buffer bit 
depth. 


This blt performs depalettization at (nominally) 1 pixel every two clocks. The bytes per pixel 
for a palette entry must be either 4 bytes per pixel, or else the destination bit depth for this blt 
to work correctly. If it is quicker on the host, then we may decide to combine the 
[PaletteIndexBitspp] and [PaletteEntryBytesPerPixel] fields. 


3.5.2 Non-palettized DIB bit 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


[dibBIt] 
[DestPhysBase] 


SrcPhysBase] 


[ 


SrcBytesPerPixel] 


L 
[SreX : SrcY] 
[ 


DstX : DstY] 


[Xsize : Ysize] 


00 | 
08 


This blt allows one to blt from a 16, 24, or 32 bit source surfaces into the current frame buffer 
bit depth (the frame buffer depth is statically encoded into the display driver primary core 
fragment). Blts into an 8bit destination are forbidden. This routine blts (nominally) two 
pixels per cycle. 


3.6 Scaled DibBlt 


Currently no primary core routine exists to perform these blts. Indeed it is impossible with 
the 2D core hardware to perform a single pass (i.e. without writing intermediate results to 
memory) scaled and depalettised blt of a 4bpp source. It is suggested that a two pass solution 
be employed in which an unscaled dibblt of the type of 3.4.1 (c4SBIt) or 3.4.2 (c8SBIt) is 
performed, using the SRCCOPY or NOTSRCCOPY ROP, to an offscreen location followed 
by a scaled bit of the type of 3.3.1 (cSScl), with aSRCCOPY ROP, using that offscreen 
location as the source for the bit. 
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3.7 Line Drawing 


The functionality provided by the 2D core in respect of lines covers arbitrary angle solid and 
styled lines which utilise a Bresenham line drawing algorithm. This is also the functional set 
provided by the 1Core with the exception of polylines. Hardware limitations require that 
transparent lines are not handled by the 2D core. Windows GDI uses binary ROPs to 
combine pen and destination data. The host driver is expected to maintain a binary to ternary 
ROP translation table which is of the form: 


Ternary ROP Binary ROP 

00h (DDx) (1) R2_ BLACK 

05h (DPon) (2) R2 NOTMERGEPEN 
0OAh (DPna) (3) R2_ MASKNOTPEN 
OFh (Pn) (4) R2 NOTCOPYPEN 
50h (PDna) (5) R2_ MASKPENNOT 
55h (Dn) (6) R2_ NOT 

5Ah (DPx) (7) R2_XORPEN 

5Fh (DPan) (8) R2 NOTMASKPEN 
AOh (DPa) (9) R2_ MASKPEN 

A5h (DPxn) (10) R2 NOTXORPEN 
AAh (D) (11) R2_NOP 

AFh (DPno) (12) R2 MERGENOTPEN 
FOh (P) (13) R2_COPYPEN 

F5h (PDno) (14) R2 MERGEPENNOT 
FAh (DPo) (15) R2_ MERGEPEN 

FFh (DDxn) (16) R2_ WHITE 


The horizontal and vertical lines are effectively unit dimension PatBlts where the pattern 
used equates to one of the 5 Windows defined Linestyles. Defined below are the Linestyles 
and their representations in the 2D core mono pattern registers: 


LineDraw Pattern Lo and Hi Registers 
LS_SOLID (0) OxFFFFFFFF (hi) OxFFFFFFFF (lo) 
LS_ DASHED (1) OxFFOOFFFF OxFFOOFFFF 
LS_DOTTED (2) OxFOFOFOFO OxFOFOFOFO 

LS _DOTDASHED § (3) OxFFOOOFFO OxFFOOOFFO 
LS_DASHDOTDOT (4) OxFFFOFOFO OxFFFOFOFO 


It is expected that the host driver will maintain tables to relate linestyle to pattern register 


contents. 
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Horizontal and Vertical Lines 


The H and V line drawing operations are divided into the following catagories for 
performance reasons: 


(1) zero read cycle line draw using binary ROPs R2_ BLACK, R2_ WHITE (HLine, VLine) 


(2) zero read cycle line draw with mono Pat (ROPs = R2_(NOT)COPYPEN) 
(mPHLine,mPV Line) 


(3) one read cycle line draw with Dst using R2_ NOT (destination invert) (DHLine, DVLine) 


(4) one read cycle line draw with Dst and mono Pat (all ROPs not covered by above) 
(mPDHLine, mPDVLine) 


Whilst the line can be passed to the Icore as delivered by GDI perfomance increases may 
obtain for lines already clipped by the host driver. 


3.7.1 Line draw using R2. BLACK , R2. WHITE & R2_NOT (H/VLine, DH/VLine) 
Byte Command Buffer 


Offset Field (32bit wide) Comments 


‘a Size of parameter block in bytes rounded up to 64 


04 [H/VLine] or Execution address of 1Core HV linedraw function 
[DH/VLine] 


[YoPixelAddr 
[XOPixel Addr] 
[Y IPixelAddr] 
[ 
[ 


a 
££ 


o) 


— — oS 
on) 


X1PixelAddr] X1 pixel address 
ClipTop]:[ClipLeft ClipRegion top pixel addr (in hi word) and left pixel addr 


‘(Cli | 
18 ClipBot]:[ClipRight] | ClipRegion bottom pixel addr (in hi word) and right pixel 
addr 
| 


aw 


ve) 
oe) 
aE, 
1S) 


Ternary ROP code corresponding to R2_ BLACK 
R2_ WHITE for H/VLine or corresponding to R2_ NOT for 
D/VHLine 


[Stride] Stride in bytes 


3.7.2 Line draw using R2_ COPYPEN and R2_ NOTCOPYPEN (mPH/VLine) 
& Line draw involving Dst and Pat (mPdH/VLine) 


[ 
ic |[ 
0 


2 


Byte Command Buffer 
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Offset Field (32bit wide) Comments 


loo foe siSY Size of parameter block in bytes rounded up to 64 


04 [mPH/VLine] or Execution address of 1Core HV linedraw function 
[mPDH/Vline] 


108 _|[DestPhysBase]___| Physical base address of fame buffer bitmap 
|| rxopixetadde) | XOpixel address 
[Y 1PixelAdar 
| |rxapixetAdde] | Xt pixel address 
[ClipTop}:{ClipLeft 


18 [ClipBot]:[ClipRight] | ClipRegion bottom pixel addr (in hi word) and right pixel 
addr 


4 |[Foregrountcolouw} | 
28_| [BackeroundColowr) | 
A value between 0 and 4, where 0=Is_solid, 1=Is_dashed, 


2¢ PatternNumber 
2=ls_ dotted, 3=Ils_dotdashed, 4=ls_dashdotdot 


General Case Solid Lines 


N 


The Bresenham midpoint line algorithm used by the 2D core restricts lines to solid (non- 
styled) lines with no ROPping, the execeptions to which are the R2_ BLACK and 
R2_WHITE ROPs which require no pen (pattern) and the R2_ COPYPEN and 

R2_ NOTCOPYPEN ROPS which do. As such two distinct interfaces are provided by the 
primary core: 


(This is no longer true, we can now do arbitrary angle non-ROPped styled lines too, but no 
one seems to do’em so we haven’t provided an interface!) 


3.7.3 Bresenham Line with no Pen (BLine) 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


0c_| [yoPixetAdde 
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|_| £xoPixetAaae 
[Y IPixel Add] 
|_| xaPixetAade 
[ClipTop]:{ClipLef 


18 [ClipBot]:[ClipRight] | ClipRegion bottom pixel addr (in hi word) and right pixel 
addr 
[ROP3] Ternary ROP code corresponding to binary ROP 
R2_ BLACK/WHITE 


1C 
Stride in bytes 


3.7.4 Bresenham Line with Pen (PBLine) 


Byte Command Buffer 


Offset Field (32bit wide) Comments 


6 
08 | [DestPhysBase] __| Physical base address of frame buffer bitmap 
|_| rxopixetadar) | XOpixeladdress Cd 
10 _| tviPixetaddry 
|| pxpixetaddr| | Xt pixeladdress 
[ClipTop}:[ClipLeft] 


18 [ClipBot]:[ClipRight] | ClipRegion bottom pixel addr (in hi word) and right pixel 
addr 
] 
] 


10 

R2_(NOT)COPYPEN 
24 | tForegroundCotou} | 
28 | BackgroundColou} | 


2c PatternNumber A value between 0 and 4, where 0=Is_solid, 1=ls_dashed, 
2=Is_dotted, 3=Ils_dotdashed, 4=ls_dashdotdot 


Polyline Drawing 
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The 1Core polyline function suffers from the 2D Core Bresenham line draw restrictions and 
provides only drawing of solid (or styled! See above) lines using R2_ (NOT)COPYPEN. 


3.7.5 Bresenham Polyline with Pen (PPLine) 


Byte Command Buffer 
Offset Field (32bit wide) Comments 


a 
NumPoints) (Rounded up to next 64) 

08___| [DestPhysBase]__| Physical base address of frame buffer bitmap 

| [pxopixetaddr) | Xopixetaddress 

|| pxtixetadar] | Xt pixeladdress 

[ClipTop}:{ClipLet 


18 [ClipBot]:[ClipRight] | ClipRegion bottom pixel addr (in hi word) and right pixel 
addr 


[ROP3] Ternary ROP code corresponding to binary ROP 
R2_(NOT)COPYPEN 
Stride Stride in bytes 


A value between 0 and 4, where 0=Is_solid, 1=Is_dashed, 
2=Is_dotted, 3=ls_dotdashed, 4=Ils_dashdotdot 


30 NumPoints Total number of points -2 (ie excluding the 2 points 
passed in OxC and 0x10) 


34 [YnPixelAddr] Yn pixel address 
[ 
tj 8 [ 


| XnPixelAddr]| Xn pixel address 


Yn+1PixelAddr Yn+1 pixel address 


38 |__| Yet pixeladdress 
| [Xn+1PixelAddr] Xn+1 pixel address 
cto | ete 


3.8 Geometric Shape Drawing 
Windows 95/NT specifies its ellipses and circles via their bounding rectangles. If the 


rectangles are composed of two (x,y) pairs, (x0,y0) and (x1,y1) representing the top-left and eS 
bottom-right vertices of the rectangle, then we define: 
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a = (x1-x0)/2 and b = (yl-y0)/2 
ie. a=half width, b=half height. We pass these values, and the origin, rather than the 
bounding rectangles. 


The primary core uses DDA algorithms for its circles and ellipses, the error terms and 
constants of which involve powers of the a and b values defined above. These terms are 
generated by the host and included in the parameter set sent to the 1Core. 


3.8.1 Hollow Ellipse/Circle 
Byte Command Buffer 


Offset Field (32bit wide) Comments 


C_| {Stride}: [ROP 
0 


14 [ClipBot]:[ClipRight] ClipRegion bottom pixel addr (in hi word) and right pixel 
addr 


& 


i) 


1ve) (oS) Loe) we) NO iw) iw) _— —" — oO 
oe) & S Q o - j=) Q Co & 


3c [X origin] 


3.8.2 Are 


TBD 
3.8.3 Hollow Pie 


TBD 
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3.8.4 Hollow Chord 


TBD 


3.8.5 Filled Ellipse/Circle 
Byte Command Buffer 


Offset Field (32bit wide) Comments 


04 
08 _| fDestPhysBase]___| Physical base address of frame buffer bitmap 
: 
in upper 8 bits of low word, ternary ROP in lower 8 bits 


10 | [ClipTop]:[Cli 
14 [ClipBot]:[ClipRight] ClipRegion bottom pixel addr (in hi word) and right pixel 
addr 


(F2] 
Colour of the border 
2 y asar] 
* a 


2 x bsqr] 


*a 
Y origin] 
a 


2*a*?b*b 
a 


[ 
peatbth 
(al-(b 
[ 
[ 


[Bsquare] b*b 

(EI) 

[X origin] 
4e__| {pattem 1 colour 


3.8.6 Filled Pie 


TBD 
3.8.7 Filled Chord 


TBD 
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3.8.8 Round Rectangle 


TBD 
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The following table relates the primary core bitblt function to the given GDI ROP code and 
pattern and source type: 


ROP 


00 Blackness No Source 


01 DPSoon 


02 DPSona 


03 PSon 


04 SDPona 


05 DPon 


No Pattern 
Bit 


Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 


No Source 


Mono Source 


Colour Source 
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Mono PatternSolid Pattern Colour Pattern 


mSmPDBIt 
cSmPDBIt 


mSmPDBIt 
cSmPDBIt 


mSmPBIt 
cSmPBIt 


mSmPDBIt 
cSmPDBIt 


mPDBIt 
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mSsPDBIt 
cSsPDBIt 


mSsPDBIt 
cSsPDBIt 


mSsPBIt 
cSsPBIt 


mSsPDBIt 
cSsPDBIt 


sPDBIt 


mScPDBIt 
cScPDBIt 


mScPDBIt 
cScPDBIt 


mScPBIt 
cScPBIt 


mScPDBIt 


cScPDBIt 


cPDBIt 
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06 PDSxnon No Source 


Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
07 PDSaon No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
08 SDPnaa No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
09 PDSxon No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
OA DPna No Source mPDBIt sPDBIt cPDBIt 

Mono Source 

Colour Source 
0B PSDnaon No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
OC SPna No Source 

Mono Source mSmPBIit mSsPBIt mScPBIt 

Colour Source cSmPBIt cSsPBIt cScPBIt 
0D PDSnaon No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
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OE PDSonon No Source 
Mono Source 
Colour Source 
OF Pn No Source 
Mono Source 
Colour Source 
10 PDSona No Source 
Mono Source 
Colour Source 
11 DSon No Source 
Mono Source 
Colour Source 
12 SDPxnon No Source 
Mono Source 
Colour Source 
13 SDPaon No Source 
Mono Source 
Colour Source 
14 DPSxnon No Source 
Mono Source 
Colour Source 
15 DPSaon No Source 
Mono Source 


Colour Source 
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mSmPDBIt mSsPDBIt 
cSmPDBIt cSsPDBIt 
mPBIt sPBIt 
mSmPDBIt mSsPDBIt 
cSmPDBIt cSsPDBIt 

mSDBIt 

cSDBIt 
mSmPDBIt mSsPDBIt 
cSmPDBIt cSsPDBIt 
mSmPDBIt mSsPDBIt 
cSmPDBIt cSsPDBIt 
mSmPDBIt mSsPDBIt 
cSmPDBIt cSsPDBIt 
mSmPDBIt mSsPDBIt 
cSmPDBIt cSsPDBIt 
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mScPDBIt 
cScPDBIt 


cPBIt 


mScPDBIt 
cScPDBIt 


mScPDBIt 
cScPDBIt 


mScPDBIt 
cScPDBIt 


mScPDBIt 
cScPDBIt 


mScPDBIt 
cScPDBIt 
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16 PSDPSanaxx No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
17 SSPxDSxaxn No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
18 SPxPDxa No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
19 SDPSanaxn No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
1A PDSPaox No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
1B SDPSxaxn No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
1C PSDPaox No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 

Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
1D DSPDxaxn No Source 

Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
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Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
1E PDSox No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
1F PDSoan No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
20 DPSnaa No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
21 SDPxon No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
22 DSna No Source 
Mono Source mSDBIt 
Colour Source cSDBIt 
23 SPDnaon No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
24 SPxDSxa No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
25 PDSPanaxn No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
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cSmPDBIt cSsPDBIt cScPDBIt 


Colour Source 


26 SDPSaox No Source 
Mono Source 


Colour Source 


27 SDPSxnox No Source 
Mono Source 


Colour Source 


28 DPSxa No Source 
Mono Source 


Colour Source 


29 PSDPSaoxxn No Source 
Mono Source 


Colour Source 


2A DPSana No Source 
Mono Source 


Colour Source 


2B SSPxPDxaxn No Source 
Mono Source 


Colour Source 


2C SPDSoax No Source 
Mono Source 


Colour Source 


2D PSDnox No Source 


Mono Source 
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mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 


mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 


mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 


mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 


mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 


mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 


mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 


mSmPDBIt mSsPDBIt mScPDBIt 
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Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
2E PSDPxox No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
2F PSDnoan No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
30 PSna No Source 
Mono Source mSmPBIt mSsPBIt mScPBIt 
Colour Source cSmPBIit cSsPBIt 
cScPBIt 
31 SDPnaon No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
32 SDPSoox No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
33 Sn No Source 
Mono Source mSBIt 
Colour Source cSBIt 
34 SPDSaox No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
35 SPDSxnox No Source 
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36 SDPox 


37 SDPoan 


38 PSDPoax 


39 SPDnox 


3A SPDSxox 


3B SPDnoan 


3C PSx No Source 
Mono Source 
Colour Source 
cScPBIt 


Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 


No Source 
Mono Source 


Colour Source 
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mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 
mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 
mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 
mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 
mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 
mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 
mSmPDBIt mSsPDBIt mScPDBIt 
cSmPDBIt cSsPDBIt cScPDBIt 
mSmPBIt mSsPBIt mScPBIt 
cSmPBIit cSsPBIt 
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3D SPDSonox No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
3E SPDSnaox No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
3F PSan No Source 
Mono Source mSmPBIit mSsPBIt mScPBIt 
Colour Source cSmPBIt cSsPBIt 
cScPBit 
40 PSDnaa No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
41 DPSxon No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
42 SDxPDxa No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 
43 SPDSanaxn No Source 
Mono Source mSmPDBIt mSsPDBIt mScPDBIt 
Colour Source cSmPDBIt cSsPDBIt cScPDBIt 


44 SDna No Source 
Mono Source mSDBIt 


Colour Source cSDBIt 
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45 DPSnaon No Source 
Mono Source 


Colour Source 


46 DSPDaox No Source 
Mono Source 


Colour Source 


47 PSDPxaxn No Source 
Mono Source 


Colour Source 
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